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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 



TAGTGGATCC 



CCCGGGCTGC 



AGGAATTCCA GTTTAATACT 



AACCCTAATG 



60 



TGTGACTGCG GTTTACAAAG AGCTCTGTAT CACCTGGGAT AGCTTTCAGT AGCAATTCAC 12 0 

TACAACTGGT CCTAAAAAAT AATAACAATA ATAATAATAA TTAGAGAATT AAAACCCAAC 18 0 

- AGCATGTTGA ATGGTTAAAA TCACGTAAGA ACTGAAATTT GGGGTGGGGG TGTCCTCAAC 24 0 

AGCTGAGCTT GTCCTAGCAG TGAAAATGCT CGCCTCCAAG CAGGGCTCAG AAAGGTCTGG 3 00 

AGCCCTCCAG GCAGAGGGCT GAGCTCAGGG GGCTCTTGGA GGACACTCAC CCCATGGTCC 3 60 

ATGGGATGCT TCTGGCTTCC TTAAAAACAG TTGGGCATCC GCATTGTATA AGTAGGTGGA 42 0 

GACCCTAGTG TGGTTCTTTT GAAGGATATG GGAAGGGAGG ATGACGAACT AGAGAAGTGG 4 80 

GAGGGGACCA AAATCACTGA GGTCCCAGAA TATCATAGAT TTGGGTATAG GATTGGGGTC 54 0 

ACTAAGAATT GAGCACCAGG AATTCCAGCT TCTTCCCATT AAAGAAACTG GGACTGGTTT 6 00 

TGCCTTGGAG GCCTATGTAG TGTTTTCTGC CCCTGTCCCA TACCAAGTCT CATTGATATT 6 60 

TCTGCAGAAT ATCAGATGAA AATCTATTTC TAAAGACCAT TGGGAGAATG GGTGGTGGAG 72 0 

AAGGAGTTGG AGTGGGGTTG GGGGGCAGTT AAAAATGAAT AAAAATCTCT CAGCTACAGA 78 0 

ACCCAAACAT CACTTCCCTC CGCATTCACA GCATTTCCCA GCAGTCCCCA GATGGTTGTT 84 0 

TCCGTGGGGA CACAGCAGCT GCCTCATTTC CCTTCAGGCC CCATGGGCTG CTGGTCAACC 9 00 

TCAGGATCTA CTAAAGATGA CGCAAATGCC GACTGAACAA TCTGAAACCC AAAGGACTCG 9 60 

AGGAGAGACA TGTTCTGCTG AGGAGAGAAA GGTGAGCCAA GGGCAGGGCC CAGGTCCCCC 10 2 0 

AGGGGGCCCC CGAGAGCCCG GACATGCACC TTCTGGATGT GTTTGTTCAA GTAGGACTTA 108 0 

GAGCGGAAGA AGCTCCCACA TTCAGGGCAT GGGTACTTCT TCTCCCCATC AGACTCCATT 114 0 

TTGTTTTTGG GGACTGCCAT GTCGCAGGAG AAAGAGCCAT TGGCACTCTG CTTCTCTGGC 12 0 0 

GTCTTCAGGT CGCTGGCATC TGAGAGGTCA CCATAGGAGT CAGAGCTCTC AATCGGATCC 12 60 

TGATGTGAGC ATTTCTGGCC TTCTCGGTTA CAGATACTGC AGAAGTTGCT GGGCCCCTCG 13 2 0 

CTGTGCTTCT TCAGGTGGTC TGCCATGTAT GCTGCCCGCA AGTACTTCCC ACACACCTGG 13 8 0 

CAGGGCACCT TGTCTTC ATG ACA GGC CAG GTG GGA GCG CAG ACG GTC TCG 14 3 0 



Met Thr Gly Gin Val Gly Ala Gin Thr Val Ser 



55 



1 



5 



10 



GGT GGC AAA AG A AGC ATT GCA GGT CTG ACA CTT GTG AGG CCG CTC AGA 14 7 8 

Gly Gly Lys Arg Ser lie Ala Gly Leu Thr Leu Val Arg Pro Leu Arg 
15 20 25 

AGT GTG CAC CTG CTT GAT ATG TCC GTT CAA GTG ATC AGG CCT GGA GAA 152 6 

Ser Val His Leu Leu Asp Met Ser Val Gin Val lie Arg Pro Gly Glu 
30 35 40 

GCC TTT CCC ACA GCT CTG GCA GAT GTA AGG CGG AAT TCC CCA GAG AAG 1574 
Ala Phe Pro Thr Ala Leu Ala Asp Val Arg Trp Asn Ser Pro Glu Lys 
45 50 55 

AAG GGT GGT GAA GAC TCC CGG CTC TCA GCT GCC CCC TGC ATC AGA CCC 162 2 

Lys Gly Gly Glu Asp Ser Trp Leu Ser Ala Ala Pro Cys lie Arg Pro 
60 65 70 75 

AGC AGC TCC CCT CCC ACT GTG GCT CCC GCA TCT GCC TCC CTG CCC CAG 1670 
Ser Ser Ser Pro Pro Thr Val Ala Pro Ala Ser Ala Ser Leu Pro Gin 

80 85 90 

CCC ATC CTC TCT AAC CAA GGA ATC ATG TTC GTT CAG GAG GAG GCC CTG 1718 
Pro lie Leu Ser Asn Gin Gly lie Met Phe Val Gin Glu Glu Ala Leu 
95 100 105 



56 



GCC AGC AGC 
Ala Ser Ser 
110 

ATT GCC CAG 
lie Ala Gin 
125 

GCA GCT TCC 
Ala Ala Ser 
140 

AGC CCA GAA 
Ser Pro Glu 

CAG ATC ATC 
Gin lie lie 

CAG GAC TTC 
Gin Asp Phe 
190 

CGG CAG CTG 
Trp Gin Leu 



CTC TCG TCC 
Leu Ser Ser 

GGA TGT TCT 
Gly Cys Ser 

GAT GAT TTA 
Asp Asp Leu 
145 

CAT GCC GAG 
His Ala Glu 
160 

TTC CTG CCC 
Phe Leu Pro 
175 

ATC CAG CGC 
lie Gin Arg 

CCT GCC TGG 
Pro Ala Trp 



ACT GAC AGT 

Thr Asp Ser 
115 

GAT TCC TTG 

Asp Ser Leu 
130 

AGG GAC GTG 

Arg Asp Val 

CCG GAG GTC 
Pro Glu Val 

TTC ACC TGC 
Phe Thr Cys 
ISO 

CTG AGC ACA 
Leu Ser Thr 
195 

ATC GAG GCT 
lie Glu Ala 



CTG ACT CCC 
Leu Thr Pro 

GAG TCC ATC 
Glu Ser lie 
135 

CCA GGA GCT 
Pro Gly Ala 
150 

CAG GTG GTG 
Gin Val Val 
165 

ATT GGC TAC 
lie Gly Tyr 

CTG ATC CGG 
Leu lie Trp 

GCC AAC CAG 
Ala Asn Gin 



GAG CAC CAG 
Glu His Gin 
120 

CCT GCG GGA 
Pro Ala Gly 

GTT GGT GGT 
Val Gly Gly 

CCG GGG TCT 
Pro Gly Ser 
170 

ACG GCC ACC 
Thr Ala Thr 
185 

CAG GCC ATC 
Gin Ala lie 
200 

CGG GAG GAG 
Trp Glu Glu 



CCC 1766 
Pro 

CAG 1814 
Gin 

GCA 1862 

Ala 

155 

GGC 1910 
Gly 

AAT 1958 
Asn 

GAG 2 00 6 

Glu 

GGC 2054 
Gly 



57 



205 210 215 

CAG GGT GAA GAG GGC GAG GAG GAG GAT GAG GAG GAG GAA GAA GAG GAG 2102 

Gin Gly Glu Gin Gly Glu Glu Glu Asp Glu Glu Glu Glu Glu Glu Glu 

220 225 230 235 

GAC GTG GCT GAG AAC CGC TAC TTT GAA ATG GGG CCC CCA GAC GTG GAG 2150 

Asp Val Ala Glu Asn Arg Tyr Phe Glu Met Gly Pro Pro Asp Val Glu 

240 245 250 

GAG GAG GAG GGA GGA GGC CAG GGG GAG GAA GAG GAG GAG GAA GAG GAG 219 8 

Glu Glu Glu Gly Gly Gly Gin Gly Glu Glu Glu Glu Glu Glu Glu Glu 

255 260 265 

GAT GAA GAG GCC GAG GAG GAG CGC CTG GCT CTG GAA TGG GCC CTG GGC 2 2 46 

Asp Glu Glu Ala Glu Glu Glu Arg Leu Ala Leu Glu Trp Ala Leu Gly 

270 275 280 

GCG GAC GAG GAC TTC CTG CTG GAG CAC ATC CGC ATC CTC AAG GTG CTG 22 94 

Ala Asp Glu Asp Phe Leu Leu Glu His lie Arg lie Leu Lys Val Leu 

285 290 295 

TGG TGC TTC CTG ATC CAT GTG CAG GGC AGT ATC CGC CAG TTC GCC GCC 2 3 42 

Trp Cys Phe Leu lie His Val Gin Gly Ser lie Arg Gin Phe Ala Ala 

300 305 310 315 



58 



TGC CTT GTG CTC ACC GAC TTC GGC ATC GCA GTC TTC GAG ATC CCG CAC 2390 
Cys Leu Val Leu Thr Asp Phe Gly He Ala Val Phe Glu He Pro His 

320 325 330 



CAG GAG TCT CGG GGC AGC AGO CAG CAC ATC CTC TCC TCC CTG CGC TTT 
Gin Glu Ser Trp Gly Ser Ser Gin His He Leu Ser Ser Leu Arg Phe 
335 340 345 

GTC TTT TGC TTC CCG CAT GGC GAC CTC ACC GAG TTT GGC TTC CTC ATG 
Val Phe Cys Phe Pro His Gly Asp Leu Thr Glu Phe Gly Phe Leu Met 
350 355 360 

CCG GAG CTG TGT CTG GTG CTC AAG GTA CGG CAC AGT GAG AAC ACG CTC 
Pro Glu Leu Cys Leu Val Leu Lys Val Arg His Ser Glu Asn Thr Leu 
365 370 375 

TTC ATT ATC TCG GAC GCC GCC AAC CTG CAC GAG TTC CAC GCG GAC CTG 
Phe He He Ser Asp Ala Ala Asn Leu His Glu Phe His Ala Asp Leu 
380 385 390 395 

CGC TCA TGC TTT GCA CCC CAG CAC ATG GCC ATG CTG TGT AGC CCC ATC 
Arg Ser Cys Phe Ala Pro Gin His Met Ala Met Leu Cys Ser Pro He 

400 405 410 

CTC TAC GGC AGC CAC ACC AGC CTG CAG GAG TTC CTG CGC CAG CTG CTC 
Leu Tyr Gly Ser His Thr Ser Leu Gin Glu Phe Leu Arg Gin Leu Leu 

59 



2438 



2486 



2534 



2582 



2630 



2678 



415 



420 



425 



ACC TTC TAC 
Thr Phe Tyr 
430 

TTC CCC GTC 
Phe Pro Val 
445 

GCC GGG GAC 
Ala Gly Asp 
460 

TCA GCC GTG 
Ser Ala Val 

GCC GCC ATC 
Ala Ala lie 

ATC AAG GCC, 
lie Lys Ala 
510 



AAG GTG GCT 
Lys Val Ala 

TAC CTG GTC 
Tyr Leu Val 

TAC TCA GGC 
Tyr Ser Gly 
465 

CGG CGC TCC 
Arg Arg Ser 
480 

CCC TAC TGG 
Pro Tyr Trp 
495 

GAC TTC AAC 
Asp Phe Asn 



GGC GGC TGC 
Gly Gly Cys 
435 

TAC AGT GAC 
Tyr Ser Asp 
450 

AAC ATC GAG 
Asn lie Glu 

TGC TGC GCG 
Cys Cys Ala 

CTG TTG CTC 
Leu Leu Leu 
500 

CCC ATG CCC 
Pro Met Pro 
515 



CAG GAG CGC 
Gin Glu Arg 

AAG CGC ATG 
Lys Arg Met 
455 

TGG GCC AGC 
Trp Ala Ser 
470 

CCC TCT GAG 
Pro Ser Glu 
485 

ACG CCC CAG 
Thr Pro Gin 

AAC CGT GGC 
Asn Arg Gly 



AGC CAG GGC 
Ser Gin Gly 
440 

GTG CAG ACG 
Val Gin Thr 

TGC ACA CTC 
Cys Thr Leu 

GCC GTC AAG 
Ala Val Lys 
490 

CAC CTC AAC 
His Leu Asn 
505 

ACC CAC AAC 
Thr His Asn 
520 



TGC 2 72 6 

Cys 

GCC 2774 
Ala 

TGT 2822 

Cys 

475 

TCC 2870 
Ser 

GTC 2918 
Val 

TGT 2966 
Cys 



60 



CGC AAC CGC AAC AGC TTC AAG CTC AGC CGT GTG CCG CTC TCC ACC GTG 3 014 

Arg Asn Arg Asn Ser PHe Lys Leu Ser Arg Val Pro Leu Ser Thr Val 
525 530 535 

CTG CTG GAC CCC ACA CGC AGC TGT ACC CAG CCT CGG GGC GCC TTT GCT 3 0 62 

Leu Leu Asp Pro Thr Arg Ser Cys Thr Gin Pro Arg Gly Ala Phe Ala 
540 545 550 555 

GAT GGC CAC GTG CTA GAG CTG CTC GTG GGG TAC CGC TTT GTC ACT GCC 3110 
Asp Gly His Val Leu Glu Leu Leu Val Gly Tyr Arg Phe Val Thr Ala 

560 565 570 

ATC TTC GTG CTG CCC CAC GAG AAG TTC CAC TTC CTG CGC GTC TAC AAC 3158 
lie Phe Val Leu Pro His Glu Lys Phe His Phe Leu Arg Val Tyr Asn 
575 580 585 

CAG CTG CGG GCC TCG CTG CAG GAC CTG AAG ACT GTG GTC ATC GCC AAG 3 2 06 

Gin Leu Arg Ala Ser Leu Gin Asp Leu Lys Thr Val Val lie Ala Lys 
590 595 600 

ACC CCC GGG ACG GGA GGC AGC CCC CAG GGC TCC TTT GCG GAT GGC CAG 32 54 

Thr Pro Gly Thr Gly Gly Ser Pro Gin Gly Ser Phe Ala Asp Gly Gin 
605 610 615 

CCT GCC GAG CGC AGG GCC AGC AAT GAC CAG CGT CCC CAG GAG GTC CCA 3 3 02 

Pro Ala Glu Arg Arg Ala Ser Asn Asp Gin Arg Pro Gin Glu Val Pro 

61 



620 



625 



630 



635 



GCA GAG GCT CTG GCC CCG GCC CCA GTG GAA GTC CCA GCT CCA GCC CCG 3 3 50 

Ala Glu Ala Leu Ala Pro Ala Pro Val Glu Val Pro Ala Pro Ala Pro 

640 645 650 

GAA TTC GAT ATC AAG CTT ATC GAT ACC GTC GAC CTG CAG 3 3 89 

Glu Phe Asp lie Lys Leu lie Asp -Thr Val Asp Leu Gin 
655 660 664 



(2) INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1954 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 2 



ATGACAGGCC 


AGGTGGGAGC 


GCAGACGGTC 


TCGGGTGGCA 


AAAGAAGCAT 


TGCAGGTCTG 


60 


ACACTTGTGA 


GGCCGCTCAG 


AAGTGTGCAC 


CTGCTTGATA 


TGTCCGTTCA 


AGTGATCAGG 


120 


CCTGGAGAAG 


CCTTTCCCAC 


AGCTCTGGCA 


GATGTAAGGC 


GGAATTCCCC 


AGAGAAGAAG 


180 


GGTGGTGAAG 


ACTCCCGGCT 


CTCAGCTGCC 


CCCTGCATCA 


GACCCAGCAG 


CTCCCCTCCC 


240 


ACTGTGGCTC 


CCGCATCTGC 


CTCCCTGCCC 


CAGCCCATCC 


TCTCTAACCA 


AG G A AT C AT G 


300 


TTCGTTCAGG 


AGGAGGCCCT 


GGCCAGCAGC 


CTCTCGTCCA 


CTGACAGTCT 


GACTCCCGAG 


360 


CACCAGCCCA 


TTGCCCAGGG 


ATGTTCTGAT 


TCCTTGGAGT 


CCATCCCTGC 


GGGACAGGCA 


420 


GCTTCCGATG 


ATTTAAGGGA 


CGTGCCAGGA 


GCTGTTGGTG 


GTGCAAGCCC 


AGAACATGCC 


480 


GAGCCGGAGG 


TCCAGGTGGT 


GCCGGGGTCT 


GGCCAGATCA 


TCTTCCTGCC 


CTTCACCTGC 


540 



62 



ATTGGCTACA 


CGGCCACCAA 


TCAGGACTTC 


ATCCAGCGCC 


TGAGCACACT 


GATCCGGCAG 


600 


GCCATCGAGC 


GGCAGCTGCC 


TGCCTGGATC 


GAGGCTGCCA 


ACCAGCGGGA 


GGAGGGCCAG 


660 


GGTGAACAGG 


GCGAGGAGGA 


GGATGAGGAG 


GAGGAAGAAG 


AGGAGGACGT 


GGCTGAGAAC 


720 


CGCTACTTTG 


AAATGGGGCC 


CCCAGACGTG 


GAGGAGGAGG 


AGGGAGGAGG 


CCAGGGGGAG 


780 


GAAGAGGAGG 


AGGAAGAGGA 


GGATGAAGAG 


GCCGAGGAGG 


AGCGCCTGGC 


TCTGGAATGG 


840 


GCCCTGGGCG 


CGGACGAGGA 


CTTCCTGCTG 


GAGCACATCC 


GCATCCTCAA 


GGTGCTGTGG 


900 


TGCTTCCTGA 


TCCATGTGCA 


GGGCAGTATC 


CGCCAGTTCG 


CCGCCTGCCT 


TGTGCTCACC 


960 


GACTTCGGCA 


TCGCAGTCTT 


CGAGATCCCG 


CACCAGGAGT 


CTCGGGGCAG 


CAGCCAGCAC 


1020 


ATCCTCTCCT 


CCCTGCGCTT 


TGTCTTTTGC 


TTCCCGCATG 


GCGACCTCAC 


CGAGTTTGGC 


1080 


TTCCTCATGC 


CGGAGCTGTG 


TCTGGTGCTC 


AAGGTACGGC 


ACAGTGAGAA 


CACGCTCTTC 


1140 


ATTATCTCGG 


ACGCCGCCAA 


CCTGCACGAG 


TTCCACGCGG 


ACCTGCGCTC 


ATGCTTTGCA 


1200 


CCCCAGCACA 


TGGCCATGCT 


GTGTAGCCCC 


ATCCTCTACG 


GCAGCCACAC 


CAGCCTGCAG 


1260 


GAGTTCCTGC 


GCCAGCTGCT 


CACCTTCTAC 


AAGGTGGCTG 


GCGGCTGCCA 


GGAGCGCAGC 


1320 


CAGGGCTGCT 


TCCCCGTCTA 


CCTGGTCTAC 


AGTGACAAGC 


GCATGGTGCA 


GACGGCCGCC 


1380 


GGGGACTACT 


CAGGCAACAT 


CGAGTGGGCC 


AGCTGCACAC 


TCTGTTCAGC 


CGTGCGGCGC 


1440 


TCCTGCTGCG 


CGCCCTCTGA 


GGCCGTCAAG 


TCCGCCGCCA 


TCCCCTACTG 


GCTGTTGCTC 


1500 


ACGCCCCAGC 


ACCTCAACGT 


CATCAAGGCC 


GACTTCAACC 


CCATGCCCAA 


CCGTGGCACC 


1560 


CACAACTGTC 


GCAACCGCAA 


CAGCTTCAAG 


CTCAGCCGTG 


TGCCGCTCTC 


CACCGTGCTG 


1620 


CTGGACCCCA 


CACGCAGCTG 


TACCCAGCCT 


CGGGGCGCCT 


TTGCTGATGG 


CCACGTGCTA 


1680 


GAGCTGCTCG 


TGGGGTACCG 


CTTTGTCACT 


GCCATCTTCG 


TGCTGCCCCA 


CGAGAAGTTC 


1740 


CACTTCCTGC 


GCGTCTACAA 


CCAGCTGCGG 


GCCTCGCTGC 


AGGACCTGAA 


GACTGTGGTC 


1800 


ATCGCCAAGA 


CCCCCGGGAC 


GGGAGGCAGC 


CCCCAGGGCT 


CCTTTGCGGA 


TGGCCAGCCT 


1860 


GCCGAGCGCA 


GGGCCAGCAA 


TGACCAGCGT 


CCCCAGGAGG 


TCCCAGCAGA 


GGCTCTGGCC 


1920 


CCGGCCCCAG 


TGGAAGTCCC 


AGCTCCAGCC 


CCGG 






1954 



63 



(3) INFORMATION FOR SEQ ID NO: 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 3 



AATTCCAGTT 


TAATACTAAC 


CCTAATGTGT 


GACTGCGGTT 


TACAAAGAGC 


TCTGTATCAC 


60 


CTGGGATAGC 


TTTCAGTAGC 


AATTCACTAC 


AACTGGTCCT 


AAAAAATAAT 


AACAATAATA 


120 


ATAATAATTA 


GAGAATTAAA 


ACCCAACAGC 


ATGTTGAATG 


GTTAAAATCA 


CGTAAGAACT 


180 


GAAATTTGGG 


GTGGGGGTGT 


CCTCAACAGC 


TGAGCTTGTC 


CTAGCAGTGA 


AAATGCTCGC 


240 


CTCCAAGCAG 


GGCTCAGAAA 


GGTCTGGAGC 


CCTCCAGGCA 


GAGGGCTGAG 


CTCAGGGGGC 


300 


TCTTGGAGGA 


CACTCACCCC 


ATGGTCCATG 


GGATGCTTCT 


GGCTTCCTTA 


AAAACAGTTG 


360 




TTGTATAAGT 


AGGTGGAGAC 


CCTAGTGTGG 


TTCTTTTGAA 


GGATATGGGA 


420 


AGGGAGGATG 


ACGAACTAGA 


GAAGTGGGAG 


GG.GACCAAAA 


TCACTGAGGT 


CCCAGAATAT 


480 


CATAGATTTG 


GGTATAGGAT 


TGGGGTCACT 


AAGAATTGAG 


CACCAGGAAT 


TCCAGCTTCT 


540 


TCCCATTAAA 


GAAACTGGGA 


CTGGTTTTGC 


CTTGGAGGCC 


TATGTAGTGT 


TTTCTGCCCC 


600 


TGTCCCATAC 


CAAGTCTCAT 


TGATATTTCT 


GCAGAATATC 


AGATGAAAAT 


CTATTTCTAA 


660 


AGACCATTGG 


GAGAATGGGT 


GGTGGAGAAG 


GAGTTGGAGT 


GGGGTTGGGG 


GGCAGTTAAA 


720 


AAT G AAT AAA 


AATCTCTCAG 


CTACAGAACC 


CAAACATCAC 


TTCCCTCCGC 


ATTCACAGCA 


780 


TTTCCCAGCA 


GTCCCCAGAT 


GGTTGTTTCC 


GTGGGGACAC 


AGCAGCTGCC 


TCATTTCCCT 


840 


TCAGGCCCCA 


TGGGCTGCTG 


GTCAACCTCA 


GGATCTACTA 


AAGATGACGC 


AAATGCCGAC 


900 


TGAACAATCT 


GAAACCCAAA 


GGACTCGAGG 


AGAGACATGT 


TCTGCTGAGG 


AGAGAAAGGT 


960 


GAGCCAAGGG 


CAGGGCCCAG 


GTCCCCCAGG 


GGGCCCCCGA 


GAGCCCGGAC 


ATGCACCTTC 


1020 


TGGATGTGTT 


TGTTCAAGTA 


GGACTTAGAG 


CGGAAGAAGC 


TCCCACATTC 


AGGGCATGGG 


1080 


TACTTCTTCT 


CCCCATCAGA 


CTCCATTTTG 


TTTTTGGGGA 


CTGCCATGTC 


GCAGGAGAAA 


1140 



64 



GAGCCATTGG 


CACTCTGCTT 


CTCTGGCGTC 


TTCAGGTCGC 


TGGCATCTGA 


GAGGTCACCA 


1200 


TAGGAGTCAG 


AGCTCTCAAT 


CGGATCCTGA 


TGTGAGCATT 


TCTGGCCTTC 


TCGGTTACAG 


1260 


ATACTG C AG A 


AGTTGCTGGG 


CCCCTCGCTG 


TGCTTCTTCA 


GGTGGTCTGC 


CATGTATGCT 


1320 


GCCCGCAAGT 


ACTTCCCACA 


CACCTGGCAG 


GGCACCTTGT 


CTTCATGACA 


GGCCAGGTGG 


1380 


. GAGCGCAGAC 


GGTCTCGGGT 


GGCAAAAGAA 


GCATTGCAGG 


TCTGACACTT 


GTGAGGCCGC 


1440 


TCAGAAGTGT 


GCACCTGCTT 


GATATGTCCG 


TTCAAGTGAT 


CAGGCCTGGA 


GAAGCCTTTC 


1500 


CCACAGCTCT 


GGCAGATGTA 


AGGCGGAATT 


CCCCAGAGAA 


GAAGGGTGGT 


GAAGACTCCC 


1560 


GGCTCTCAGC 


TGCCCCCTGC 


ATCAGACCCA 


GCAGCTCCCC 


TCCCACTGTG 


GCTCCCGCAT 


1620 


CTGCCTCCCT 


GCCCCAGCCC 


ATCCTCTCTA 


ACCAAGGAAT 


CATGTTCGTT 


CAGGAGGAGG 


1680 


CCCTGGCCAG 


CAGCCTCTCG 


TCCACTGACA 


GTCTGACTCC 


CGAGCACCAG 


CCCATTGCCC 


1740 


AGGGATGTTC 


TGATTCCTTG 


GAGTCCATCC 


CTGCGGGACA 


GGCAGCTTCC 


GATGATTTAA 


1800 


GGGACGTGCC 


AGGAGCTGTT 


GGTGGTGCAA 


GCCCAGAACA 


TGCCGAGCCG 


GAGGTCCAGG 


1860 


TGGTGCCGGG 


GTCTGGCCAG 


ATCATCTTCC 


TGCCCTTCAC 


CTGCATTGGC 


TACACGGCCA 


1920 


CCAATCAGGA 


CTTCATCCAG 


CGCCTGAGCA 


CACTGATCCG 


GCAGGCCATC 


GAGCGGCAGC 


1980 


TGCCTGCCTG 


GATCGAGGCT 


GCCAACCAGC 


GGGAGGAGGG 


CCAGGGTGAA 


CAGGGCGAG6 


2040 


AGGAGGATGA 


GGAGGAGGAA 


GAAGAGGAGG 


ACGTGGCTGA 


GAACCGCTAC 


TTTGAAATGG 


2100 


GGCCCCCAGA 


CGTGGAGGAG 


GAGGAGGGAG 


GAGGCCAGGG 


GGAGGAAGAG 


GAGGAGGAAG 


2160 


AGGAGGATGA 


AGAGGCCGAG 


GAGGAGCGCC 


TGGCTCTGGA 


ATGGGCCCTG 


GGCGCGGACG 


2220 


AGGACTTCCT 


GCTGGAGCAC 


ATCCGCATCC 


TCAAGGTGCT 


GTGGTGCTTC 


CTGATCCATG 


2280 


TGCAGGGCAG 


TATCCGCCAG 


TTCGCCGCCT 


GCCTTGTGCT 


CACCGACTTC 


GGCATCGCAG 


2340 


TCTTCGAGAT 


CCCGCACCAG 


GAGTCTCGGG 


GCAGCAGCCA 


GCACATCCTC 


TCCTCCCTGC 


2400 


GCTTTGTCTT 


TTGCTTCCCG 


CATGGCGACC 


TCACCGAGTT 


TGGCTTCCTC 


ATGCCGGAGC 


2460 


TGTGTCTGGT 


GCTCAAGGTA 


CGGCACAGTG 


AGAACACGCT 


CTTCATTATC 


TCGGACGCCG 


2520 


CCAACCTGCA 


CGAGTTCCAC 


GCGGACCTGC 


GCTCATGCTT 


TGCACCCCAG 


CACATGGCCA 


2580 


TGCTGTGTAG 


CCCCATCCTC 


TACGGCAGCC 


ACACCAGCCT 


GCAGGAGTTC 


CTGCGCCAGC 


2640 


TGCTCACCTT 


CTACAAGGTG 


GCTGGCGGCT 


GCCAGGAGCG 


CAGCCAGGGC 


TGCTTCCCCG 


2700 



1 



TCTACCTGGT CTACAGTGAC AAGCGCATGG 

ACATCGAGTG GGCCAGCTGC ACACTCTGTT 

CTGAGGCCGT CAAGTCCGCC GCCATCCCCT 

ACGTCATCAA GGCCGACTTC AACCCCATGC 

.GCAACAGCTT CAAGCTCAGC CGTGTGCCGC 

GCTGTACCCA GCCTCGGGGC GCCTTTGCTG 

ACCGCTTTGT CACTGCCATC TTCGTGCTGC 

ACAACCAGCT GCGGGCCTCG CTGCAGGACC 

GGACGGGAGG CAGCCCCCAG GGCTCCTTTG 

GCAATGACCA GCGTCCCCAG GAGGTCCCAG 

TCCCAGCTCC AGCCCCGG 



TGCAGACGGC CGCCGGGG AC TACTCAGGCA 2 7 60 

CAGCCGTGCG GCGCTCCTGC TGCGCGCCCT 2 82 0 

ACTGGCTGTT GCTCACGCCC CAGCACCTCA 2 88 0 

CCAACCGTGG CACCCACAAC TGTCGCAACC 2 94 0 

TCTCCACCGT GCTGCTGGAC CCCACACGCA 3 00 0 

ATGGCCACGT GCTAGAGCTG CTCGTGGGGT 3 060 

CCCACGAGAA GTTCCACTTC CTGCGCGTCT 312 0 

TGAAGACTGT GGTCATCGCC AAGACCCCCG 318 0 

CGGATGGCCA GCCTGCCGAG CGCAGGGCCA 3 240 

CAGAGGCTCT GGCCCCGGCC CCAGTGGAAG 3 3 00 

3318 



(4) INFORMATION FOR SEQ ID NO: 4 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
GAGGAGGAGG AAGAGGAGGA TGAAGAGGCC GAGGAGGAGC GCCTGGCTCT GGAATGGGCC 
CTGGGCGCGG ACGAGGACTT CCTGCTGGAG CACATCCGCA TCCTCAAGGT GCTGTGGTGC 
TTCCTGATCC ATGTGCAGGG CAGTATCCGC CAGTTCGCCG CCTGCCTTGT GCTCACCGAC 
TTCGGCATCG CAGTCTTCGA GATCCCGCAC CAGGAGTCTC GGGGCAGCAG CCAGCACATC 
CTCTCCTCCC TGCGCTTTGT CTTTTGCTTC CCGCATGGCG ACCTCACCGA GTTTGGCTTC 
CTCATGCCGG AGCTGTGTCT GGTGCTCAAG GTACGGCACA GTGAGAACAC GCTCTTCATT 
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ATCTCGGACG 


CCGCCAACCT 


GCACGAGTTC 


CACGCGGACC 


TGCGCTCATG 


CTTTGCACCC 


420 


CAGCACATGG 


CCATGCTGTG 


TAGCCCCATC 


CTCTACGGCA 


GCCACACCAG 


CCTGCAGGAG 


480 


TTCCTGCGCC 


AGCTGCTCAC 


CTTCTACAAG 


GTGGCTGGCG 


GCTGCCAGGA 


GCGCAGCCAG 


540 


* 

GGCTGCTTCC 


CCGTCTACCT 


GGTCTACAGT 


GACAAGCGCA 


TGGTGCAGAC 


GGCCGCCGGG 


600 


- GACTACTCAG 


GCAACATCGA 


GTGGGCCAGC 


TGCACACTCT 


GTTCAGCCGT 


GCGGCGCTCC 


660 


TGCTGCGCGC 


CCTCTGAGGC 


CGTCAAGTCC 


GCCGCCATCC 


CCTACTGGCT 


GTTGCTCACG 


720 


CCCCAGCACC 


TCAACGTCAT 


CAAGGCCGAC 


TTCAACCCCA 


TGCCCAACCG 


TGGCACCCAC 


780 


AACTGTCGCA 


ACCGCAACAG 


CTTCAAGCTC 


AGCCGTGTGC 


CGCTCTCCAC 


CGTGCTGCTG 


840 


GACCCCACAC 


GCAGCTGTAC 


CCAGCCTCGG 


GGCGCCTTTG 


CTGATGGCCA 


CGTGCTAGAG 


900 


CTGCTCGTGG 


GGTACCGCTT 


TGTCACTGCC 


ATCTTCGTGC 


TGCCCCACGA 


GAAGTTCCAC 


960 


TTCCTGCGCG 


TCTACAACCA 


GCTGCGGGCC 


TCGCTGCAGG 


ACCTGAAGAC 


TGTGGTCATC 


1020 


GCCAAGACCC 


CCGGGACGGG 


AGGCAGCCCC 


CAGGGCTCCT 


TTGCGGATGG 


CCAGCCTGCC 


1080 


GAGCGCAGGG 


CCAGCAATGA 


CCAGCGTCCC 


CAGGAGGTCC 


CAGCAGAGGC 


TCTGGCCCCG 


1140 


GCCCCAGTGG 


AAGTCCCAGC 


TCCAGCCCCG 


G 
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(5) : 


INFORMATION 


FOR SEQ ID 


NO: 5 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 651 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

Met Thr Gly Gin Val Gly Ala Gin Thr Val Ser 
1 . 5 10 

Gly Gly Lys Arg Ser He Ala Gly Leu Thr Leu Val Arg Pro Leu Arg 
15 20 25 
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